Overview

Dataset Statistics

Number of Variables 12
Number of Rows 1539
Missing Cells 3261
Missing Cells (%) 17.7%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 410.0 KB
Average Row Size in Memory 272.8 B
Variable Types
  • Categorical: 7
  • Numerical: 5

Dataset Insights

ID is uniformly distributed Uniform
CapacityFactor has 1539 (100.0%) missing values Missing
Generation(kWd) has 1539 (100.0%) missing values Missing
Temp(°Cd) has 183 (11.89%) missing values Missing
Angle is skewed Skewed
Capacity is skewed Skewed
Irradiance(kWd/m2) is skewed Skewed
Date has a high cardinality: 112 distinct values High Cardinality
Set has constant value "test" Constant
Set has constant length 4 Constant Length
Date has constant length 10 Constant Length
CapacityFactor has all distinct values Unique
Generation(kWd) has all distinct values Unique
Angle has 550 (35.74%) negatives Negatives
Angle has 332 (21.57%) zeros Zeros
  • 1
  • 2

Variables


Set

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 106191

Length

Mean 4
Standard Deviation 0
Median 4
Minimum 4
Maximum 4

Sample

1st row test
2nd row test
3rd row test
4th row test
5th row test

Letter

Count 6156
Lowercase Letter 6156
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • Set has words of constant length

ID

numerical

Approximate Distinct Count 1539
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 24624
Mean 770
Minimum 1
Maximum 1539
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ID is uniformly distributed

Quantile Statistics

Minimum 1
5-th Percentile 77.9
Q1 385.5
Median 770
Q3 1154.5
95-th Percentile 1462.1
Maximum 1539
Range 1538
IQR 769

Descriptive Statistics

Mean 770
Standard Deviation 444.4153
Variance 197505
Sum 1.185e+06
Skewness 0
Kurtosis -1.2
Coefficient of Variation 0.5772
  • ID is not normally distributed (p-value 0.001707976880722162)

Date

categorical

Approximate Distinct Count 112
Approximate Unique (%) 7.3%
Missing 0
Missing (%) 0.0%
Memory Size 115425

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2021-10-29
2nd row 2021-10-29
3rd row 2021-10-29
4th row 2021-10-29
5th row 2021-10-29

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 3078
Decimal Number 12312
  • Date has words of constant length

Lat

categorical

Approximate Distinct Count 9
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Memory Size 107944

Length

Mean 5.1391
Standard Deviation 0.3461
Median 5
Minimum 5
Maximum 6

Sample

1st row 24.98
2nd row 25.11
3rd row 25.11
4th row 25.03
5th row 24.107

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 6370

Lon

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Memory Size 109158
  • The largest value (120.52) is over 1.97 times larger than the second largest value (121.26)

Length

Mean 5.9279
Standard Deviation 0.2588
Median 6
Minimum 5
Maximum 6

Sample

1st row 121.03
2nd row 121.26
3rd row 121.26
4th row 121.08
5th row 120.44

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 7584
  • The largest value (12052) is over 1.97 times larger than the second largest value (12126)

Angle

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 24624
Mean -17.3875
Minimum -160
Maximum 22
Zeros 332
Zeros (%) 21.6%
Negatives 550
Negatives (%) 35.7%
  • Angle is skewed left (γ1 = -2.1267)

Quantile Statistics

Minimum -160
5-th Percentile -160
Q1 -2.62
Median 0
Q3 4.63
95-th Percentile 22
Maximum 22
Range 182
IQR 7.25

Descriptive Statistics

Mean -17.3875
Standard Deviation 47.8469
Variance 2289.3255
Sum -26759.33
Skewness -2.1267
Kurtosis 3.277
Coefficient of Variation -2.7518
  • Angle is not normally distributed (p-value 1.6081017169036507e-17)
  • Angle has 440 outliers

Module

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 124222
  • The largest value (AUO PM060MW3 320W) is over 3.01 times larger than the second largest value (MM60-6RT-300)

Length

Mean 15.716
Standard Deviation 2.0826
Median 17
Minimum 12
Maximum 17

Sample

1st row SEC-6M-60A-295
2nd row MM60-6RT-300
3rd row MM60-6RT-300
4th row MM60-6RT-300
5th row AUO PM060MW3 320W

Letter

Count 10664
Lowercase Letter 0
Space Separator 2198
Uppercase Letter 10664
Dash Punctuation 992
Decimal Number 10333
  • The top 2 categories (AUO PM060MW3 320W, MM60-6RT-300) take over 50.0%

Capacity

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 24624
Mean 335.6543
Minimum 99.2
Maximum 499.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Capacity is skewed left (γ1 = -0.3129)

Quantile Statistics

Minimum 99.2
5-th Percentile 99.2
Q1 267.52
Median 314.88
Q3 492.8
95-th Percentile 499.8
Maximum 499.8
Range 400.6
IQR 225.28

Descriptive Statistics

Mean 335.6543
Standard Deviation 132.4862
Variance 17552.588
Sum 516572
Skewness -0.3129
Kurtosis -0.8659
Coefficient of Variation 0.3947
  • Capacity is not normally distributed (p-value 5.6062801895246684e-14)

CapacityFactor

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 104652

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row nan
2nd row nan
3rd row nan
4th row nan
5th row nan

Letter

Count 4617
Lowercase Letter 4617
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • CapacityFactor has words of constant length

Generation(kWd)

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 104652

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row nan
2nd row nan
3rd row nan
4th row nan
5th row nan

Letter

Count 4617
Lowercase Letter 4617
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • Generation(kWd) has words of constant length

Irradiance(kWd/m2)

numerical

Approximate Distinct Count 294
Approximate Unique (%) 19.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 24624
Mean 3.3784
Minimum 0.2611
Maximum 5.6111
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Irradiance(kWd/m2) is skewed left (γ1 = -0.5668)

Quantile Statistics

Minimum 0.2611
5-th Percentile 0.7556
Q1 2.2778
Median 3.8667
Q3 4.525
95-th Percentile 5.1194
Maximum 5.6111
Range 5.35
IQR 2.2472

Descriptive Statistics

Mean 3.3784
Standard Deviation 1.3991
Variance 1.9573
Sum 5199.4056
Skewness -0.5668
Kurtosis -0.9353
Coefficient of Variation 0.4141
  • Irradiance(kWd/m2) is not normally distributed (p-value 3.5009769164836956e-08)

Temp(°Cd)

numerical

Approximate Distinct Count 131
Approximate Unique (%) 9.7%
Missing 183
Missing (%) 11.9%
Infinite 0
Infinite (%) 0.0%
Memory Size 21696
Mean 18.8634
Minimum 12.3
Maximum 28
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Temp(°Cd) is skewed right (γ1 = 0.7031)

Quantile Statistics

Minimum 12.3
5-th Percentile 14.4
Q1 16.6
Median 18
Q3 20.7
95-th Percentile 25
Maximum 28
Range 15.7
IQR 4.1

Descriptive Statistics

Mean 18.8634
Standard Deviation 3.3494
Variance 11.2188
Sum 25578.8
Skewness 0.7031
Kurtosis -0.1952
Coefficient of Variation 0.1776
  • Temp(°Cd) is not normally distributed (p-value 0.004936531077023483)
  • Temp(°Cd) has 24 outliers

Interactions

Correlations

Missing Values